AITopics | taxonomy induction

Collaborating Authors

taxonomy induction

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Chain-of-Layer: Iteratively Prompting Large Language Models for Taxonomy Induction from Limited Examples

Zeng, Qingkai, Bai, Yuyang, Tan, Zhaoxuan, Feng, Shangbin, Liang, Zhenwen, Zhang, Zhihan, Jiang, Meng

arXiv.org Artificial IntelligenceFeb-11-2024

Automatic taxonomy induction is crucial for web search, recommendation systems, and question answering. Manual curation of taxonomies is expensive in terms of human effort, making automatic taxonomy construction highly desirable. In this work, we introduce Chain-of-Layer which is an in-context learning framework designed to induct taxonomies from a given set of entities. Chain-of-Layer breaks down the task into selecting relevant candidate entities in each layer and gradually building the taxonomy from top to bottom. To minimize errors, we introduce the Ensemble-based Ranking Filter to reduce the hallucinated content generated at each iteration. Through extensive experiments, we demonstrate that Chain-of-Layer achieves state-of-the-art performance on four real-world benchmarks.

entity list, taxonomy, taxonomy induction, (13 more...)

arXiv.org Artificial Intelligence

2402.07386

Country:

North America > United States > Indiana > St. Joseph County > Notre Dame (0.05)
North America > United States > Washington > King County > Seattle (0.04)
Europe > North Macedonia > Skopje Statistical Region > Skopje Municipality > Skopje (0.04)
(3 more...)

Genre: Research Report > New Finding (0.68)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.91)

Add feedback

Path Based Hierarchical Clustering on Knowledge Graphs

Pietrasik, Marcin, Reformat, Marek

arXiv.org Artificial IntelligenceSep-27-2021

Knowledge graphs have emerged as a widely adopted medium for storing relational data, making methods for automatically reasoning with them highly desirable. In this paper, we present a novel approach for inducing a hierarchy of subject clusters, building upon our earlier work done in taxonomy induction. Our method first constructs a tag hierarchy before assigning subjects to clusters on this hierarchy. We quantitatively demonstrate our method's ability to induce a coherent cluster hierarchy on three real-world datasets.

cluster hierarchy, hierarchy, knowledge graph, (14 more...)

arXiv.org Artificial Intelligence

2109.13178

Country:

South America > Uruguay > Montevideo > Montevideo (0.04)
South America > Colombia (0.04)
North America > Cuba > La Habana Province > Havana (0.04)
(8 more...)

Genre: Research Report (0.70)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (0.79)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.51)

Add feedback

TiFi: Taxonomy Induction for Fictional Domains [Extended version]

Chu, Cuong Xuan, Razniewski, Simon, Weikum, Gerhard

arXiv.org Artificial IntelligenceJan-29-2019

Taxonomies are important building blocks of structured knowledge bases, and their construction from text sources and Wikipedia has received much attention. In this paper we focus on the construction of taxonomies for fictional domains, using noisy category systems from fan wikis or text extraction as input. Such fictional domains are archetypes of entity universes that are poorly covered by Wikipedia, such as also enterprise-specific knowledge bases or highly specialized verticals. Our fiction-targeted approach, called TiFi, consists of three phases: (i) category cleaning, by identifying candidate categories that truly represent classes in the domain of interest, (ii) edge cleaning, by selecting subcategory relationships that correspond to class subsumption, and (iii) top-level construction, by mapping classes onto a subset of high-level WordNet categories. A comprehensive evaluation shows that TiFi is able to construct taxonomies for a diverse range of fictional domains such as Lord of the Rings, The Simpsons or Greek Mythology with very high precision and that it outperforms state-of-the-art baselines for taxonomy induction by a substantial margin.

category, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

1901.10263

Country:

Europe > Germany > Saarland > Saarbrücken (0.04)
Europe > Slovenia > Coastal-Karst > Municipality of Koper > Koper (0.04)
Europe > Italy > Piedmont > Turin Province > Turin (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.82)

Industry:

Leisure & Entertainment (1.00)
Media > Film (0.47)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
(2 more...)

Add feedback

280 Birds With One Stone: Inducing Multilingual Taxonomies From Wikipedia Using Character-Level Classification

Gupta, Amit (Ecole Polytechnique Fédérale de Lausanne) | Lebret, Rémi (Ecole Polytechnique Fédérale de Lausanne) | Harkous, Hamza (Ecole Polytechnique Fédérale de Lausanne) | Aberer, Karl (Ecole Polytechnique Fédérale de Lausanne)

AAAI ConferencesFeb-8-2018

We propose a novel fully-automated approach towards inducing multilingual taxonomies from Wikipedia. Given an English taxonomy, our approach first leverages the interlanguage links of Wikipedia to automatically construct training datasets for the isa relation in the target language. Character-level classifiers are trained on the constructed datasets, and used in an optimal path discovery framework to induce high-precision, high-coverage taxonomies in other languages. Through experiments, we demonstrate that our approach significantly outperforms the state-of-the-art, heuristics-heavy approaches for six languages. As a consequence of our work, we release presumably the largest and the most accurate multilingual taxonomic resource spanning over 280 languages.

machine learning, natural language, text classification, (22 more...)

AAAI Conferences

Thirty-Second AAAI Conference on Artificial Intelligence

Country:

North America > Canada (1.00)
Europe (0.93)

Genre: Overview (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Communications > Social Media (0.87)
Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.53)
(2 more...)

Add feedback